Multilayered extensions to the speech synthesis markup language for describing expressiveness
نویسندگان
چکیده
In this paper we discuss possible extensions to the Speech Synthesis Markup Language (SSML) to facilitate the generation of synthetic expressive speech. The proposed extensions are hierarchical in nature, allowing specification in terms of physical parameters such as instantaneous pitch, higher-level parameters such as ToBI labels, or abstract concepts such as emotions. Low-level tags tend to change their values frequently, even within a word, while the more abstract tags generally apply to whole words, sentences or paragraphs. We envision interfaces at different levels to serve different types of users; speech experts may want to use low-level interfaces while artists may prefer to interface with the TTS system at more abstract levels.
منابع مشابه
The IBM expressive speech synthesis system
This paper introduces the IBM Expressive Speech Synthesis system. We describe recent work in improving the quality of our baseline text-to-speech system as well as extending our capabilities to generate expressive synthetic speech. We present results showing improved base quality, especially for sentences drawn from a limited domain. We also demonstrate our ability to convey good news and bad n...
متن کاملThe Concept of Speech Synthesis Markup Language
Synthetic speech close to natural sounding can be heard now a day. Recent advancement of multimedia interfaces between man and machine largely increased interests on Speech Synthesis Markup Language which could be used to control the speech synthesis system to generate more expressive speech and extremely extends its functions in human machine interaction. As we know, the speech synthesis syste...
متن کاملA Corpus-based Approach to <ahem/> Expressive Speech Synthesis
Human speech communication can be thought of as comprising two channels – the words themselves, and the style in which they are spoken. Each of these channels carries information. Today's most-advanced text-to-speech (TTS) systems such as [1],[2],[3],[4] fall far short of human speech because they offer only a single, fixed style of delivery, independent of the message. In this paper, we descri...
متن کاملSSML: A speech synthesis markup language
This paper describes the Speech Synthesis Markup Language, SSML, which has been designed as a platform independent interface standard for speech synthesis systems. The paper discusses the need for standardisation in speech synthesizers and how this will help builders of systems make better use of synthesis. The SGML based markup language is then discussed, and details of the Edinburgh SSML inte...
متن کاملSSML Extensions Aimed To Improve Asian Language TTS Rendering
Both formant synthesis based and concatenative acoustic unit based TTS systems have been developled in Nokia. Many non-English languages have been considered in the development work, and Nokia's Mandarin Chinese TTS system is under continuous development within the TC-STAR framework (www.tc-star.org). To meet the needs of the TTS evaluations in TC-STAR, common interfaces for the input and all t...
متن کامل